Inference and Learning for Active Sensing, Experimental Design and Control

نویسندگان

  • Hendrik Kück
  • Matthew D. Hoffman
  • Arnaud Doucet
  • Nando de Freitas
چکیده

In this paper we argue that maximum expected utility is a suitable framework for modeling a broad range of decision problems arising in pattern recognition and related fields. Examples include, among others, gaze planning and other active vision problems, active learning, sensor and actuator placement and coordination, intelligent humancomputer interfaces, and optimal control. Following this remark, we present a common inference and learning framework for attacking these problems. We demonstrate this approach on three examples: (i) active sensing with nonlinear, non-Gaussian, continuous models, (ii) optimal experimental design to discriminate among competing scientific models, and (iii) nonlinear optimal control. 1 The Principle of Maximum Expected Utility Broadly speaking, utility reflects the preferences of an agent. That is, if outcome o1 is preferred to o2 (i.e. o1 o2), we say that o1 has higher utility than o2. More formally, let o1 o2 denote weak preference, o1 o2 denote strong preference and o1 ∼ o2 denote indifference. Define a lottery to be a random set of outcomes with corresponding probabilities: l = [(o1, p1), (o2, p2), . . . , (ok, pk)], where the probabilities satisfy pi ≥ 0 and ∑k i pi = 1 as usual. Now consider the following axioms: 1. Completeness: ∀o1, o2, we have o1 o2, o2 o1 or o1 ∼ o2. 2. Transitivity: If o1 o2 and o2 o3, then o1 o3. 3. Substitutability: If o1 ∼ o2, then for all sequences of outcomes o3, . . . , ok and sets of probabilities p, p3, . . . , pk for which p + ∑k i=3 pi = 1, we have [(o1, p), (o3, p3), . . . , (ok, pk)] ∼ [(o2, p), (o3, p), . . . , (ok, pk)]. 4. Decomposability: Let Pl(oi) be the probability that outcome oi is selected by lottery l. If for all oi: Pl1(oi) = Pl2(oi), then l1 ∼ l2. 5. Monotonicity: If o1 o2 and p > q, then [(o1, p), (o2, 1−p)] [(o1, q), (o2, 1− q)]. 6. Continuity: If o1 o2 and o2 o3 then ∃p ∈ [0, 1] such that o2 ∼ [(o1, p), (o3, 1− p)]. Using these axioms, von Neumann and Morgenstern [16] proved the following fundamental result showing the existence of utility:

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effects of Active Memory Exercises on Intelligence Profile in Students With Specific Learning Disorder

Background: Active memory is the search engine of the mind. Active memory is a cognitive function responsible for preserving instant information, its manipulation, and its use in thinking. This study aimed at investigating the effects of active memory practices on intelligence profiles in students with Specific Learning Disorder (SLD). Methods: This was a quasi-experimental study with a prete...

متن کامل

Position Control of a Pulse Width Modulated Pneumatic Systems: an Experimental Comparison

In this study, a new adaptive controller is proposed for position control of pneumatic systems. Difficulties associated with the mathematical model of the system in addition to the instability caused by Pulse Width Modulation (PWM) in the learning-based controllers using gradient descent, motivate the development of a new approach for PWM pneumatics. In this study, two modified Feedback Error L...

متن کامل

Sparse linear models: Variational approximate inference and Bayesian experimental design

A wide range of problems such as signal reconstruction, denoising, source separation, feature selection, and graphical model search are addressed today by posterior maximization for linear models with sparsity-favouring prior distributions. The Bayesian posterior contains useful information far beyond its mode, which can be used to drive methods for sampling optimization (active learning), feat...

متن کامل

The Effects of Cooperative Language Learning Strategies on Learning Active and Passive Structures among Iranian EFL Learners

This study aims at investigating the effects of cooperative language learning on learning active and passive structures among Iranian EFL students. The participants of the study were 60 high school students that were selected from third grade of Barikbin high school in Qazvin. All of the participants were male. Their level of proficiency was intermediate. Then the participants were divided into...

متن کامل

Neuro-fuzzy control of bilateral teleoperation system using FPGA

This paper presents an adaptive neuro-fuzzy controller ANFIS (Adaptive Neuro-Fuzzy Inference System) for a bilateral teleoperation system based on FPGA (Field Programmable Gate Array). The proposed controller combines the learning capabilities of neural networks with the inference capabilities of fuzzy logic, to adapt with dynamic variations in master and slave robots and to guarantee good prac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009